wetdog's Repositories
58 repositories
accelerate
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
β 0
π Public
aiexperiments-bird-sounds
Thousands of bird sounds visualized using machine learning.
β 0
π Public
AQI-Catalonia-Challenge
No description
β 0
π Public
AudibleLight
Spatial soundscape synthesis using ray-tracing
β 0
π Public
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
β 0
π Public
audio-transformers-course
The Hugging Face Course on Transformers for Audio
β 0
π Public
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
β 0
π Public
AudioNotebooks
Collection of notebooks and scripts related to audio processing and machine learning.
β 0
π Public
audioset_experiments
Various experiments with the audioset database and tensorflow
β 0
π Public
audio_pipeline
No description
β 0
π Public
audio_preprocess
Simple scripts and utils for audio dataset preparation
β 0
π Public
cheatsheet-translation
Translation of VIP cheatsheets for Machine Learning and Deep Learning
β 0
π Public
cumbiaGEN
Music ai team projects
β 0
π Public
dash-case
No description
β 0
π Public
dataspeech
No description
β 0
π Public
DCASE2017-baseline-system
DCASE 2017 Baseline system
β 0
π Public
DCASE_explorations
Experiments with the DCASE framework and database
β 0
π Public
e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
β 0
π Public
EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
β 0
π Public
Env_soundrecognition
Environmental sound recognition of traffic events such as car, motorcycle, heavy vehicle and horn using librosa and sci-kit learn
β 1
π Public
esp32-i2s-slm
Sound Level Meter with ESP32 and I2S MEMS microphone
β 0
π Public
faust
Functional programming language for signal processing and sound synthesis
β 0
π Public
gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
β 0
π Public
insanely-fast-whisper
No description
β 0
π Public
introtodeeplearning_labs
Lab Materials for MIT 6.S191: Introduction to Deep Learning
β 0
π Public
makemore
An autoregressive character-level language model for making more things
β 0
π Public
mems-micros
ICS-43432 mems breakout circular board
β 0
π Public
models
Models and examples built with TensorFlow
β 0
π Public
musicinformationretrieval.com
Instructional notebooks on music information retrieval.
β 0
π Public
open-tts-tracker
No description
β 0
π Public
orca-embeddings
Extraction pipelines and experiments with audio embeddings (Jose's GSoC work, 2021)
β 0
π Public
p5.js-sound
p5.sound brings the Processing approach to Web Audio and p5.js. Demos:
β 0
π Public
PAM
PAM is a no-reference audio quality metric for audio generation tasks
β 0
π Public
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
β 0
π Public
psysound3
psysound3 getting backroom surgery to work with MIRtoolbox
β 0
π Public
pyfilterbank
Implementing a fractional octave filterbank for python. Based on Numpy and CFFI.
β 0
π Public
rpi-loggers
Basic audio and gps data loggers based on Raspberry
β 0
π Public
RPI_SLM
Basic sound level meter in raspberry using pyfilterbank library
β 1
π Public
scatter-sounds
Web visualization and listening page for sound datasets.
β 3
π Public
sliderspace
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
β 0
π Public
stanford-tensorflow-tutorials
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
β 0
π Public
StyleTTS2_fabric
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
β 0
π Public
surround-soundscape
No description
β 2
π Public
Text-to-speech
No description
β 0
π Public
Tidal
Pattern language
β 0
π Public
TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β 0
π Public
TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
β 0
π Public
tts-fpitch
An exercise to fine tune a fast-pitch model using the coqui tts framework on a specific speaker of the artic dataset
β 0
π Public
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
β 0
π Public
video2dataset
Easily create large video dataset from video urls
β 0
π Public
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
β 0
π Public
vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
β 5
π Public
voicefixer
General Speech Restoration
β 0
π Public
wavenext_pytorch
Unofficial implementation of wavenext vocoder
β 53
π Public
webMUSHRA
a MUSHRA compliant web audio API based experiment software
β 0
π Public
wetdog
Page to customize the profile header
β 0
π Public
wetdog.github.io
Personal webpage forked from https://academicpages.github.io
β 0
π Public
youtube-8m
Starter code for working with the YouTube-8M dataset.
β 0
π Public